Recognition of Characters from Streaming Videos
نویسندگان
چکیده
Over the past few years, Video has become one of the prime source for recreation, be it Television or Internet. Television brings a whole lot of professionally produced video content (International or local, sports or educational, news or entertainment) to the home for the masses. Similarly, Internet hosts a whole lot of video content uploaded by other users. Understanding the context of the video automatically can open up avenue for a lot of valueadded applications. Now, what do we mean by understanding the context? Video context is usually associated with audio, image, graph, text etc. that are embedded within the Video – these are the information that help us understand the content of the video. Video texts can provide a lot of contextual information on the video clips. In general, there are two types of texts embedded inside video namely, scene texts and artificial texts. Scene texts appear naturally in scenes shot by the cameras. Artificial texts are separately added to video frames (normally in Production Studios) to supplement the visual and audio contents (Lienhart, 1996). Since artificial text is purposefully added, it is usually more structured and closely related to context than a scene text. The text data in video frames contain useful information for automatic annotation, indexing and summarization of the visual information. Extraction of the text information involves the following processes – 1. Detection of Text Region 2. Localization of Text Region from the detected region 3. Tracking of Text from Localized Region 4. Extraction of Tracked Text 5. Enhancement of the Extracted Text 6. Recognition of the text from the Enhanced Input 7. Post-processing (language dependant) of Recognized Text
منابع مشابه
A Method to Reduce Effects of Packet Loss in Video Streaming Using Multiple Description Coding
Multiple description (MD) coding has evolved as a promising technique for promoting error resiliency of multimedia system in real-time application programs over error-prone communicational channels. Although multiple description lattice vector quantization (MDCLVQ) is an efficient method for transmitting reliable data in the context of potential error channels, this method doesn’t consider disc...
متن کاملContinuous Learning of Human Activity Models Using Deep Nets
Learning activity models continuously from streaming videos is an immensely important problem in video surveillance, video indexing, etc. Most of the research on human activity recognition has mainly focused on learning a static model considering that all the training instances are labeled and present in advance, while in streaming videos new instances continuously arrive and are not labeled. I...
متن کاملطراحی و پیادهسازی سامانۀ بیدرنگ آشکارسازی و شناسایی پلاک خودرو در تصاویر ویدئویی
An automatic Number Plate Recognition (ANPR) is a popular topic in the field of image processing and is considered from different aspects, since early 90s. There are many challenges in this field, including; fast moving vehicles, different viewing angles and different distances from camera, complex and unpredictable backgrounds, poor quality images, existence of multiple plates in the scene, va...
متن کاملLive coding youtube: organizing streaming media for an audiovisual performance
Music listening has changed greatly with the emergence of music streaming services, such as Spotify and YouTube. In this paper, we discuss an artistic practice that organizes streaming videos to perform a real-time improvisation via live coding. A live coder uses any available video from YouTube, a video streaming service, as source material to perform an improvised audiovisual piece. The chall...
متن کاملActive Learning for Online Recognition of Human Activities from Streaming Videos
Recognising human activities from streaming videos poses unique challenges to learning algorithms: predictive models need to be scalable, incrementally trainable, and must remain bounded in size even when the data stream is arbitrarily long. Furthermore, as parameter tuning is problematic in a streaming setting, suitable approaches should be parameterless, and make no assumptions on what class ...
متن کاملDetection and Recognition of Multi-language Traffic Sign Context by Intelligent Driver Assistance Systems
Design of a new intelligent driver assistance system based on traffic sign detection with Persian context is concerned in this paper. The primary aim of this system is to increase the precision of drivers in choosing their path with regard to traffic signs. To achieve this goal, a new framework that implements fuzzy logic was used to detect traffic signs in videos captured along a highway f...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010